2 PATENT 

AMENDMENT 

Amendments to the Claims 

Please amend claims 1, 5-12, and 14-15. Please cancel claims 16-19. Please add new 
claims 20-27. The currently pending claims after amendment are listed below. 



1 1 . (Currently Amended) A digital camera, comprising: 

2 a housing; 

3 a digital optical sensing apparatus mounted within said housing, said digital optical sensing 

4 apparatus sensing optical images; 

5 a storage medium for storing digital optical images captured by said digital optical sensing 

6 apparatus; 

7 an acoustic sensor capable of sensing human speech; 

8 a speech reduction apparatus coupled to said acoustic sensor, said speech reduction 

9 apparatus converting human speech sensed by said acoustic sensor to a symbolic text form; and 

10 a controller which stores said symbolic text form in said storage medium in a relationship 

1 1 associated with a captured digital image , wherein said controller: 

12 fa) receives a user indication of a plurality of discrete time intervals; 

13 (h) records a plurality of discrete human speech segments sensed by said acoustic 

14 sensor in respective said discrete time intervals: 

15 (c) causes said speech reduction apparatus to convert each said human speech 

16 segment to a corresponding symbolic text segment: and 

17 (d) automatically associates a respective digital optical image captured by said digital 

18 optical sensing apparatus with each said symbolic text segment based on a temporal 

19 relationship between the time interval in which the discrete human speech segment 

20 corresponding to the symbolic text segment was recorded and the capturing of said digital 

21 optical image . 
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1 2. (Original) The digital camera of claim 1, wherein said controller comprises a 

2 programmable processor executing a control program for controlling the operation of said digital 

3 camera. 

1 3. (Original) The digital camera of claim 2, wherein said speech reduction apparatus 

2 comprises a speech reduction algorithm embodied as a plurality of instructions executable on said 

3 programmable processor. 

1 4. (Original) The digital camera of claim 1, wherein said speech reduction apparatus 

2 converts said human speech sensed by said acoustic sensor to an intermediate symbolic form 

3 comprising a symbolic representation of phonemes, said intermediate symbolic form being 

4 subsequently reduced to natural language text by a separate apparatus. 
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1 5. (Currently Amended) A method of operating a digital camera, comprising the steps of: 

2 capturing a plurality of digital imag e images of an object respective objects of interest with 

3 optical sensing apparatus of said digital camera; 

4 recording a plurality of discrete segments of human speech of a user in said digital camera 

5 during a plurality of respective discrete time intervals , said recording step being performed each 

6 respective discrete time interval occurring substantially contemporaneously with said step of 

7 capturing a of each respective digital image of said plurality of digital images : 

8 rendering each said segment of said plurality of discrete segments of human speech in a 

9 respective corresponding segment of symbolic text form using speech reduction apparatus within 

1 0 said digital camera; and 

11 automatically associating each respective digital image of said plurality of digital images 

12 with a respective corresponding segment of symbolic text rendered from a respective 

13 corresponding segment of human speech based on a temporal relationship between the respective 

14 discrete time interval during which the corresponding segment of human speed was recorded and 

1 5 the capturing of the respective digital image, and storing each said symbolic text form segment in 

16 a relationship associated with each respective said captured digital image. 

1 6. (Currently Amended) The method of operating a digital camera of claim 5, wherein said 

2 step of rendering each said segment of said plurality of discrete segments of human speech in a 

3 respective corresponding segment of symbolic text form converts said human speech to an 

4 intermediate symbolic form comprising a symbolic representation of phonemes, said intermediate 

5 symbolic form being subsequently reduced to natural language text by an apparatus separate from 

6 said digital camera. 
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1 7. (Currently Amended) The method of operating a digital camera of claim 5, wherein said 

2 step of rendering each said segment of said plurality of discrete segments of human speech in a 

3 respective corresponding segment of symbolic text form is performed by a programmable 

4 processor executing a speech reduction program. 

1 8. (Currently Amended) The method of operating a digital camera of claim 7, wherein said 

2 programmable processor further executes a control program for controlling the operation of said 

3 digital camera, and said step of rendering each said segment of said plurality of discrete segments 

4 of human speech in a respective corresponding segment of symbolic text form is performed by 

5 said programmable processor in the background when said control program is otherwise 

6 unoccupied. 
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1 9. (Currently Amended) A program product for controlling the operation of a digital camera, 

2 said program product comprising a plurality of processor executable instructions recorded on 

3 signal-bearing media, wherein said instructions, when executed by at least one programmable 

4 processor within said digital camera, cause the camera to perform the steps of: 

5 capturing a plurality of digital image images of an object respective objects of interest with 

6 optical sensing apparatus of said digital camera; 

7 recording a plurality of discrete segments of human speech of a user in said digital camera 

8 dui ing a plurality of respective discrete time intervals , said re co r ding st e p being p e rform e d each 

9 respective discrete time interval occurring substantially contemporaneously with said st e p of 

1 0 capturing a of each respective digital image of said plurality of digital images : 

1 1 rendering each said segment of said plurality of discrete segments of human speech in a 

12 respective corresponding segment of symbolic text fo r m using speech reduction apparatus within 

1 3 said digital camera; and 

14 associating each respective digital image of said plurality of digital images with a 

15 respective corresponding segment of symbolic text rendered from a respective corresponding 

16 segment of human speech based on a temporal relationship between the respective discrete time 

17 interval during which the corresponding segment of human speech was recorded and the capturing 

1 8 of the respective digital image, and storing each said symbolic text form segment in a relationship 

19 associated with each respective said captured digital image. 



1 10. (Currently Amended) The program product for controlling the operation of a digital 

2 camera of claim 9, wherein said step of rendering each said segment of said plurality of discrete 

3 segments of human speech in a respective corresponding segment of symbolic text form converts 

4 said human speech to an intermediate symbolic form comprising a symbolic representation of 

5 phonemes, said intermediate symbolic form being subsequently reduced to natural language text 

6 by an apparatus separate from said digital camera. 
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1 11. (Currently Amended) A method of recording information with digital images, comprising 

2 the steps of: 

3 capturing at l e ast one a plurality of digital image images of a respective object objects of 

4 interest with optical sensing apparatus of a digital camera; 

5 recording at l e ast on e s e gment a plurality of discrete segments of human speech of a user in 

6 said digital camer a, e ach segment co rre sponding to a re s pe ctive digital imag e , said r e cording step 

7 being performed during a plurality of respective discrete time intervals occurring substantially 

8 contemporaneously with said step of capturing the of each respective digital image of said 

9 plurality of digital images ; 

1 0 rendering each said at least one segment of said plurality of discrete segments of human 

1 1 speech into at least one a respective corresponding segment of symbolic text form using speech 

12 reduction apparatus within said digital camera; 

13 automatically associating each respective digital image of said plurality of digital images 

14 with a respective corresponding segment of symbolic text rendered from a respective 

15 corresponding segment of human speech based on a temporal relationship between the respective 

16 discrete time interval during which the corresponding segment of human speech was recorded and 

17 the capturing of the respective digital image, and recording said association in a memory of said 

18 digital camera: 

19 uploading said at least one digital image and said at least one segment of symbolic text to a 

20 digital image formatting apparatus; and 

2 1 formatting said at l e ast on e plurality of digital trrrage images and said at least one plurality 

22 of s e gment segments of symbolic text for viewing by a user using said digital image formatting 

23 apparatus, wherein each said segment of symbolic text is formatted for viewing in a human 

24 readable form associated with its corresponding digital image. 
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1 12. (Currently Amended) The method of recording information with digital images of claim 

2 11, wherein said step of rendering each said at l e ast on e segment of said plurality of discrete 

3 segments of human speech in a respective corresponding segment of symbolic text form converts 

4 said human speech to an intermediate symbolic form comprising a symbolic representation of 

5 phonemes, and wherein said step of formatting said at l e ast one plurality of digital imag e images 

6 and said at least on e segment plurality of segments of symbolic text for viewing comprises 

7 reducing said intermediate symbolic form to natural language text. 

1 13. (Original) The method of recording information with digital images of claim 1 1 , wherein 

2 said digital image formatting apparatus is a general -purpose digital computer executing a digital 

3 image formatting program. 

1 14. (Currently Amended) The method of recording information with digital images of claim 

2 11, wherein said step of formatting said at least one plurality of digital tntagc images and said at 

3 l e ast one s e gm e nt plurality of segments of symbolic text comprises formatting for output on 

4 paper, wherein formatted text is printed on paper with a corresponding digital image. 

1 15. (Currently Amended) The method of recording information with digital images of claim 

2 11, wherein said step of formatting said at least one plurality of digital image images and said at 

3 least one s e gment plurality of segments of symbolic text comprises formatting for viewing from 

4 an output screen of a digital device, wherein formatted text is displayed on said output screen 

5 with a corresponding digital image. 

16-19. (Cancelled) 
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1 20. (New) The digital camera of claim 1 , wherein said controller associates a respective 

2 digital image with each symbolic text segment according to at least one of the following 

3 association priorities: 

4 (1) if a first digital image is captured during the recording of a human speech segment 

5 corresponding to the symbolic text segment, the symbolic text segment is associated with the first 

6 digital image; 

7 (2) if no digital image is captured from a time the digital camera is powered on until the 

8 end of the recoding of the human speech segment corresponding to the symbolic text segment, 

9 and a second digital image is captured after recording the human speech segment but before the 

1 0 digital camera is powered off, then the symbolic text segment is associated with the second digital 

11 image; and 

12 (3) in all other cases, the symbolic text is associated with the digital image most recently 

13 captured before the recording of the human speech segment corresponding to the symbolic text 

14 segment. 

1 21. (New) The digital camera of claim 20, wherein said controller associates a respective 

2 digital image with each symbolic text segment according to all of said association priorities. 
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1 22. (New) The method of claim 5, wherein said step of automatically associating each 

2 respective digital image of said plurality of digital images with a respective corresponding 

3 segment of symbolic text comprises automatically associating according to at least one of the 

4 following association priorities: 

5 (1) if ^ first digital image is captured during the recording of a human speech segment 

6 corresponding to the symbolic text segment, the symbolic text segment is associated with the first 

7 digital image; 

8 (2) if no digital image is captured from a time the digital camera is powered on until the 

9 end of the recoding of the human speech segment corresponding to the symbolic text segment, 

10 and a second digital image is captured after recording the human speech segment but before the 

1 1 digital camera is powered off, then the symbolic text segment is associated with the second digital 

12 image; and 

13 (3) in all other cases, the symbolic text is associated with the digital image most recently 

14 captured before the recording of the human speech segment corresponding to the symbolic text 

1 5 segment. 

1 23. (New) The method of claim 22, wherein said step of automatically associating each 

2 respective digital image of said plurality of digital images with a respective corresponding 

3 segment of symbolic text comprises automatically associating according to all of said association 

4 priorities. 
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24. (New) The program product of claim 9, wherein said step of associating each respective 
digital image of said plurality of digital images with a respective corresponding segment of 
symbolic text comprises associating according to at least one of the following priorities: 

(1) if a first digital image is captured during the recording of a human speech segment 
corresponding to the symbolic text segment, the symbolic text segment is associated with the first 
digital image; 

(2) if no digital image is captured from a time the digital camera is powered on until the 
end of the recoding of the human speech segment corresponding to the symbolic text segment, 
and a second digital image is captured after recording the human speech segment but before the 
digital camera is powered off, then the symbolic text segment is associated with the second digital 
image; and 

(3) in all other cases, the symbolic text is associated with the digital image most recently 
captured before the recording of the human speech segment corresponding to the symbolic text 
segment. 

25. (New) The program product of claim 24, wherein said step of associating each respective 
digital image of said plurality of digital images with a respective corresponding segment of 
symbolic text comprises associating according to all of said association priorities. 
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1 26. (New) The method of claim 11, wherein said step of automatically associating each 

2 respective digital image of said plurality of digital images with a respective corresponding 

3 segment of symbolic text comprises automatically associating according to at least one of the 

4 following priorities: 

5 (1) if a first digital image is captured during the recording of a human speech segment 

6 corresponding to the symbolic text segment, the symbolic text segment is associated with the first 

7 digital image; 

8 (2) if no digital image is captured from a time the digital camera is powered on until the 

9 end of the recoding of the human speech segment corresponding to the symbolic text segment, 

1 0 and a second digital image is captured after recording the human speech segment but before the 

1 1 digital camera is powered off, then the symbolic text segment is associated with the second digital 

12 image; and 

13 (3) in all other cases, the symbolic text is associated with the digital image most recently 

14 captured before the recording of the human speech segment corresponding to the symbolic text 

1 5 segment. 

1 27. (New) The method of claim 26, wherein said step of automatically associating each 

2 respective digital image of said plurality of digital images with a respective corresponding 

3 segment of symbolic text comprises automatically associating according to all of said association 

4 priorities. 
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